Fast Collective Communication Libraries, Please

نویسندگان

  • Prasenjit Mitra
  • David G. Payne
  • Lance Shuler
  • Robert van de Geijn
چکیده

It has been recognized that many parallel numerical algorithms can be eeectively implemented by formulating the required communication as collective communications. Nonetheless, the eeciency of such communications has been suboptimal in many communication library implementations. In this paper, we give a brief overview of techniques that can be used to implement a high performance collective communication library, the iCC library, developed for the Intel family of parallel supercomputers as part of the InterCom project at the University of Texas at Austin. We compare the achieved performance on the Intel Paragon to those of three widely available libraries: Intel's NX collective communication library, the MPICH Message Passing Interface (MPI) implementation developed at Argonne and Mississippi State University and a Basic Linear Algebra Communication Subprograms (BLACS) implementation, developed at the University of Tennessee.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance evaluation of communications and computations on CRAY T3E

We present in this paper an evaluation of the performances of communication and computations on the CRAY T3E. We have implemented all the main communication schemes (point-to-point and collective ones) using all the message passing libraries available on the target machines. We have also implemented two basic numerical kernels (FFT and CG) in order to evaluate the computation performances and t...

متن کامل

PVM and MPI Communication Operations on the IBM SP2: Modeling and Comparison

Most current message passing programs use the portable communication libraries PVM and MPI to realize communication. In this paper, we investigate the performance of single transfer operations and several collective communication operations, like broadcast or gather operations, using the portable communication libraries PVM and MPI on the IBM SP2. Our investigations include timings of different...

متن کامل

The collective computing model

The parallel computing model presented in this paper, the Collective Computing model (CCM), is an improvement of the well-known Bulk Synchronous Parallel (BSP) model. The synchronicity imposed by the BSP model restricts the set of available algorithms and prevents the overlapping of computation and communication. Other models, like the LogP model, allow asynchronous computing and overlapping bu...

متن کامل

Tuning MPI Collectives by Verifying Performance Guidelines

ABSTRACT MPI collective operations provide a standardized interface for performing data movements within a group of processes. The e ciency of collective communication operations depends on the actual algorithm, its implementation, and the speci c communication problem (type of communication, message size, number of processes). Many MPI libraries provide numerous algorithms for speci c collecti...

متن کامل

Eecient Collective Communication on Heterogeneous Networks of Workstations 1

Networks of Workstations (NOW) have become an attractive alternative platform for high performance computing. Due to the commodity nature of workstations and interconnects, the NOW environments are being gradually redeened as Heterogeneous Networks of Workstations (HNOW) environments. This paper presents a new framework to implement collective communication operations (as deened by the Message ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995